A Developed Algorithm of Apriori Based on Association Analysis

نویسندگان

  • Li Pingxiang
  • Chen Jiangping
  • Bian Fuling
چکیده

Based on association analysis , an improved algorithm of Apriori is presented in the paper. The main idea of the algorithm are: (1) Count the probability of each attribute item(A1 , A2,...Am) of a DB by scanning the DB first time; (2)The probability of any two items Ak and Am appeared synchronously in one record is Pkm. min( Pk , Pm )≤Pkm ≤Pk *Pm , if Ak and Am is total correlation, then the Pkm is the minimum of the Pk and Pm,; if Ak and Am is total independent, then the Pkm is Pk *Pm; So we can estimate : Pkm =(a*min(Pk, Pm)+b*Pk*Pm)/(a+b); a+b=1 Parameter “a” is the probability while Ak and Am are total correlation, Parameter “b” is the probability while Ak and Am are total independent, Parameter “a” and “b” can use other method such as association analysis to count. In this paper a method for calculate the parameter “a” and ”b” with association analysis is provided. if Pkm is more than the threshold value which the user set, then Ak , Am are the frequent itemsets. You can use the method which described above to find out all the frequent itemsets without scanning DB so many times. (3)Count the support of the frequent itemsets by scanning the DB another time; (4)Output the association rules from the frequent itemsets. The detailed algorithm and it's sample are described in the paper . At last we compared it with algorithm apriori. The best quality is that the algorithm in our paper reduce the times of scanning DB.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New Approaches to Analyze Gasoline Rationing

In this paper, the relation among factors in the road transportation sector from March, 2005 to March, 2011 is analyzed. Most of the previous studies have economical point of view on gasoline consumption. Here, a new approach is proposed in which different data mining techniques are used to extract meaningful relations between the aforementioned factors. The main and dependent factor is gasolin...

متن کامل

Mining the Banking Customer Behavior Using Clustering and Association Rules Methods

  The unprecedented growth of competition in the banking technology has raised the importance of retaining current customers and acquires new customers so that is important analyzing Customer behavior, which is base on bank databases. Analyzing bank databases for analyzing customer behavior is difficult since bank databases are multi-dimensional, comprised of monthly account records and daily t...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

Identifying Important Factors of Arthroplasty in Patients with Degenerative Knee Osteoarthritis Based on Association Rule Mining Approach

Background and Aim: Total Knee Arthroplasty (TKA) aims to reduce the pain and improve the quality of life of patients with progressive osteoarthritis. When the indication of patients' disease is established, this type of surgery should be performed as soon as possible because patients' late attendance increases surgical complications. Therefore, identification of factors influencing the choice ...

متن کامل

Applying a decision support system for accident analysis by using data mining approach: A case study on one of the Iranian manufactures

Uncertain and stochastic states have been always taken into consideration in the fields of risk management and accident, like other fields of industrial engineering, and have made decision making difficult and complicated for managers in corrective action selection and control measure approach. In this research, huge data sets of the accidents of a manufacturing and industrial unit have been st...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003